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Complex sound like speech can be characterized as the sum of number of amplitude-modu- 
lated signals representing the outputs of an array of narrow frequency bands. Temporal infor- 
mation at the output of each band can be separated into temporal fine structure (TFS), the 
rapid oscillations close to the center frequency and temporal envelope (ENV), slower amplitude 
modulations superimposed on the TFS. TFS information can be carried in the pattern of phase 
locking to the stimulus waveform, while ENV by the changes in firing rate over time. The relative 
importance of temporal ENV and TFS information in understanding speech has been studied 
using various sound-processing techniques. A number of studies demonstrated that ENV cues 
are associated with speech recognition in quiet, while TFS cues are possibly linked to melody/ 
pitch perception and listening to speech in a competing background. However, there are evi- 
dences that recovered ENV from TFS as well as TFS itself may be partially responsible for 
speech recognition. Current technologies used in cochlear implants (CI) are not efficient in deliv- 
ering the TFS cues, and new attempts have been made to deliver TFS information into sound- 
processing strategy in CI. We herein discuss the current updated findings of TFS with a litera- 
ture review. Korean J Audiol 2014;18(1):1-7 

KEY WORDS: Temporal fine structure ■ Temporal envelope ■ Speech perception ■ Hearing 
loss. 



Introduction 

Multiple acoustic cues are usually used to interpret and un- 
derstand speech in the human auditory system. These acous- 
tic cues are largely classified on the basis of their temporal and 
spectral properties. The temporal properties can be flirther clas- 
sified into the magnitude of the Hilbert transform on the speech 
signals and its phase information. The basic representation of 
speech stimuli at all levels of the auditory system is tonotopic, 
meaning that different frequencies in the stimulus are analyzed 
separately. Thus, spectral information of speech is processed 
as a result of cochlear filtering, which is a kind of series of 
bandpass-filtered signals. Each filtered signal corresponds to 
one specific place on the basilar membrane. The time signal at 
a specific position on the basilar membrane can be decomposed 
by using the Hilbert transform into two forms of temporal in- 
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formation, temporal envelope (ENV) and temporal fine struc- 
ture (TFS)(Fig. 1). The ENV is characterized by the slow vari- 
ation in the amplitude of the speech signal over time, while TFS 
is the rapid oscillations with rate close to the center frequen- 
cy of the band." Both ENV and TFS information are repre- 
sented in the timing of neural discharges. It is commonly be- 
lieved that ENV cues are represented in the auditory system 
as fluctuations in the short-term rate of firing in auditory neu- 
rons, while TFS is represented by the synchronization of nerve 
spikes to a specific phase of the carrier (phase locking).*' Al- 
though the upper limit of phase locking in humans is unknown 
yet, phase locking begins to weaken for high frequencies above 
4—5 kHz in most mammals. 

It has been of considerable interest to understand the rela- 
tive contribution of these acoustic cues for speech perception. 
A number of studies have investigated the roles the cues to 
understand which are particularly important for speech per- 
ception in quiet and in complex backgrounds. This paper 
will especially describe the role of TFS information in speech 
perception and its clinical applications. 
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Original sentence 




Fig. 1 . A: Time waveform of a sentence spoken by a male speaker. B: The envelope (in red line) is superimposed on the original sentence 
signal. The broadband envelope of the sentence is plotted. C: The envelope in this example was determined by the absolute value of Hu- 
bert transformation followed by the lowpass filter at 200 Hz. Temporal fine structure (TFS) is determined by the cosine values of the angle 
of analytic signal of the sentence. Note that the TFS shows no fluctuation over time, while the envelope shows slow fluctuation over time. 
The original sentence signal can be derived by modulating the TFS signal with the envelope. 



Techniques to Evaluate the Effect of 
Acoustic TFS on Speech Perception 

To investigate the role of TFS in speech perception, a variety 
of techniques have been developed and applied. The main 
concepts of these techniques are splitting sound into multiple 
frequency bands and processing the signal in each band to 
preserve only ENV or TFS cues, or varying TFS information 
in each band. 

Smith, et al."" first developed "speech chimeras", which have 
the ENV of one sound and the TFS of another, to investigate 
the relative perceptual importance of ENV and TFS in differ- 
ent acoustic situations. A bank of band-pass filters is used to 
split each sound, and the output of each filter is factored into 
its ENV and TFS using the Hilbert transform. The ENV of each 
filter output from the first sound is then multiplied by the TFS 
of the corresponding filter output from the second sound, and 
these are summed over all frequency bands to produce "chi- 
meras". That is, the chimeras are made up of the ENV of the 
first sound and the TFS of the second sound in each band. Us- 
ing speech chimeras, the authors revealed that TFS is impor- 
tant for pitch perception and sound localization. 

Other way to study the role of TFS is to remove ENV cues in 
the speech as far as possible while TFS cues are preserved.'" 
Thus, "TFS-speech", which contains TFS information only. 



have been developed and used for investigating the role of 
TFS on speech perception. The Hilbert transform was used 
to decompose the signal in each frequency band into its ENV 
and TFS components. Then, the ENV component was re- 
moved. As a result, each band signal becomes like an frequen- 
cy modulations (FM) sinusoidal carrier, with a constant am- 
plitude. The TFS in each band was multiplied by a constant 
equal to the root-mean square amplitude of the original band 
signals. The "power- weigh ted" TFS signals were then com- 
bined over all frequency bands. 

Although this precise processing preserves TFS information 
to some extent, the TFS information can be distorted from the 
original TFS in the unprocessed speech. In addition, Ghitza'^* 
showed the recovered ENV cues from TFS-speech, which are 
reconstructed at the outputs of the auditory filters. This recon- 
structed ENV cues may influence the intelligibility of TFS- 
speech. After that, Lorenzi, et al.*" showed evidence indicating 
that only minimal influence was observed from reconstructed 
ENV cues when the bandwidth of the analysis filters was nar- 
row (less than 4 equivalent rectangular bandwidth). 

A series of recent studies by Hopkins, et al."*''''^' used a dif- 
ferent approach to evaluate the contribution of TFS to speech 
perception in background noise. They measured speech recep- 
tion thresholds in noise as a function of the number of chan- 
nels containing TFS information. Signals were left unprocessed 
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for channels up to a certain frequency limit (i.e. both ENV and 
TPS information are preserved) and were vocoded for higher 
frequency channels (i.e. only ENV information is preserved). 
As the cut-off channel increased, the amount of TPS informa- 
tion available also increased. 

The Role of TFS on Pitch Perception 

Pitch is important for speech and music perception as well 
as segregating sounds that arrive from different sources. If fiin- 
damental frequency (PO) and its perceptual correlate— pitch— 
of simultaneous sounds are same, the sounds are more likely to 
be heard as a single object. Thus, patients with cochlear hear- 
ing loss or cochlear implant (CI) users who have poorer pitch 
perception ability may have some difficulties in understanding 
or segregating speech in complex everyday acoustic environ- 
ments.''" Previous studies over many years suggests that TPS 
plays a role in the pitch perception for both pure and complex 
tones. Especially, Smith, et al.'°' reported a significant role of 
TPS information on pitch perception of complex harmonic 
sounds by using chimaeras. They synthesized chimaeras based 
on two different melodies, one in the ENV and the other in the 
TPS. Up to 23 frequency bands, listeners always heard the 
melody based on the TPS, while the ENV-based melody was 
more often heard with 48 and 64 frequency bands. This result 
suggests that melody reception depends primarily on TPS in- 
formation in broad frequency bands. In contrast, when using 
speech-speech and speech-noise chimaeras, the crossover 
point, where the envelope begins to dominate over the fine 
structure, occurs for a much lower number of frequency bands. 
Thus, in contrast to pitch perception, speech recognition main- 
ly depends on ENV information in broad frequency bands. 

The Role of TFS on Speech 
Recognition in Noise 

ENV cues are sufficient to give good speech intelligibility 
in quiet, but they are not enough in the presence of background 
noise. That means ENV cues alone are not suflScient to percep- 
tually segregate mixtures of sounds. A number of previous 
studies have suggested that TPS is required for speech percep- 
tion in noise, particularly fluctuating noise, such as a com- 
peting talker.'''^' 

It is usually easier to detect a speech in modulated than in 
steady noise condition, especially when the frequency be- 
tween the signal and noise is different. This effect has usually 
been ascribed to the ability to "dip listening" of the fluctuating 
background sound. Moore and Glasberg""' have demonstrat- 
ed that TPS provides a cue that allows effective dip listening 



when the masker and signal frequencies fall in the range 
where phase locking is relatively precise. However, for the 
highest masker center frequency where phase locking is weak 
or absent, the difference of thresholds in the modulating 
masker and in the steady masker (masking release) became 
smaller, at about 10 dB. This dramatic decrease in masking 
release for the masker centered at high frequencies is consis- 
tent with the idea that the mechanism that decodes TPS infor- 
mation is less effective when there are rapid changes in TPS. 
The other study regarding masking experiments has also been 
interpreted as indicating a role of TPS in dip listening.'^' To 
assess the role of TPS information in dip listening, Hopkins, 
et al."* '" used a different approach to assess the use of TPS in 
speech perception. They measured speech recognition thresh- 
olds in steady and modulated noise for signals that were fil- 
tered into 32 bands and tone vocoded above a variable cut-off 
band. Piltered sound below the cut-off band contained intact 
TPS and ENV information, while the other bands were noise 
or tone vocoded, so only ENV information was conveyed in 
those bands above the cut-off band. As expected, normal- 
hearing listeners showed an improvement in speech percep- 
tion (by about 15 dB in speech reception thresholds) in noise 
as more TPS information was added. In addition, the im- 
provement in speech reception was larger for the modulated 
than steady noise condition. This result is in consistent with 
the concept TPS information plays a significant role in the 
ability to identify speech in a fluctuating background. 

Reduced Ability to Use TFS 
in Hearing-Impaired Listeners 

Hopkins, et al.^' directly assessed the ability to use TPS in- 
formation in hearing-impaired listeners. Speech reception 
thresholds in a competing talker background were assessed 
using signals with variable amounts of TPS information in 
normal-hearing and hearing-impaired subjects. Hearing-im- 
paired group showed less benefit from the additional TPS in- 
formation than normal-hearing group. In addition, there are a 
lot of individual variability regarding the benefit in hearing- 
impaired listeners. This reduced ability to use TPS informa- 
tion in hearing-impaired group may be partially responsible 
for decreased ability of listening in a fluctuating background 
than normal-hearing group. 

Another experiment by Hopkins and Moore"*' measured 
ability to use TPS by discriminating a harmonic complex tone 
from a tone in which all components were shifted upwards 
by the same amount in Hz. Non-shaped stimuli, containing 
five equal-amplitude component, or shaped stimuli, contained 
many component and passed through a fixed bandpass filter 
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to reduce excitation pattern changes, were used in the experi- 
ment. The performance for the shaped stimuli in hearing-im- 
paired group was much poorer than that in normal-hearing 
group, suggesting that they could not use the TFS information 
effectively. However, the performance for the non-shaped stim- 
uli in hearing-impaired group was significantly improved, 
suggesting that the hearing-impaired listeners may benefit 
from spectral cues, hi contrast to hearing-impaired listeners, 
normal-hearing subjects are able to use TFS information to 
discriminate inharmonic and frequency-shifted complexes with 
components that are unresolved. 

Some previous studies regarding speech perception have 
also investigated the effect of cochlear hearing loss in under- 
standing speech with emphasis on TFS information. Lorenzi, 
et al." designed an experiment using unprocessed, ENV- 
speech, and TFS-speech in quiet for three groups: young with 
normal hearing, young with moderate hearing loss, and elder- 
ly with moderate hearing loss. Normal hearing listeners can 
understand all three types of speech with more than 90% 
correct rates. Performance for unprocessed and ENV-speech 
in both young and elderly listeners with hearing loss are al- 
most similar to normal hearing group. However both groups 
with hearing loss performed very poorly with TFS-speech, 
suggesting that cochlear hearing loss negatively affect the 
ability to use TFS. Hopkins, et al."" also reported similar results 
that the hearing-impaired group performed more poorly than 
the normal-hearing group in all test conditions, and the differ- 
ence between the two groups becomes greater with addition 
of TFS information. 

Moore" proposed several possible reasons regarding the 
reduced ability to use TFS information in hearing-impaired 
listeners. First, cochlear hearing loss can cause reduced syn- 
chrony capture (reduced precision of phase locking), possi- 
bly as a result of diminished two-tone suppression. Second, 
the relative phase of response at different points along the 
basilar membrane can be changed in hearing-impaired listen- 
ers, which could influence mechanisms for decoding TFS. 
Third, due to broadened auditory filters, more complex and 
rapidly-varying TFS are generated, which might be difficult 
for central mechanisms to decode the TFS information. Fourth, 
a shift in frequency-place mapping caused by hearing loss 
may disrupt the decoding process of TFS information. Fifth, 
there may be central changes following hearing loss, such as 
loss of inhibition, and this might make it more difficult to de- 
code the TFS information. 

Reconstructed ENV from TFS 

There have been debates about the real role of TFS informa- 



tion in speech perception using a vocoder study. Some research- 
ers have argued that the TFS role for speech perception have 
been somewhat overestimated because the ENV cues can be 
reconstructed from TFS cues in the human auditory filters. 
Modelling work by Ghitza'^' has demonstrated that the degraded 
speech-envelope cues may be recovered at the output of audi- 
tory filters because the signal's envelope and instantaneous fre- 
quency information are related. Thus, it is still unclear how TFS 
information contributes to speech intelligibility. Besides phase- 
locking in auditory-nerve fibers, these recovered ENV cues may 
be responsible for the speech intelligibility in a real world.'"" 
Gilbert and Lorenzi^"' assessed the contribution of recov- 
ered ENV cues to speech perception as a function of the anal- 
ysis bandwidth in normal hearing listeners. In this study, ENV 
cues recovered at the output of auditory filters can be used by 
listeners to identify consonants especially when the bandwidth 
is broad. However, when the analysis bandwidth was narrow- 
er than four times the bandwidth of a normal auditory filter, the 
recovered ENV cues did not play a significant role in speech 
perception. For patients with mild to moderate hearing loss, co- 
chlear damage reduces the ability to use the TFS cues of speech, 
and this is partially due to poor reconstruction of ENV from 
TFS.'" 

Won, et al.'^' evaluated the mechanism of recovered ENV 
in CI users. Stimuli containing only TFS information were 
created using 1, 2, 4, and 8-band FM-vocoders. A consistent 
improvement was observed as the band number decreased 
from 8 to I (as more ENV were reocovered). This result indi- 
cates that CI sound processor can generates recovered ENV 
cues from broadband TFS, and CI users can use the recovered 
ENV cues to understand speech. 

Cochlear Implant and Temporal 
Fine Structure 

Several sound processing strategies have been developed 
over time for multi-channel implants. One of the most fre- 
quently used strategy in implant signal processing is the con- 
tinuous interleaved sampling (CIS). In the CIS, sound is passed 
through a bank of bandpass filters, and the ENV of each filter 
output is obtained via rectification and lowpass filtering. How- 
ever, during the lowpass filtering of the envelope, TFS cues 
are largely discarded, and this is one of the important limita- 
tions of sound processing strategies in CI today. That is, cur- 
rent CI systems convey mainly ENV information in different 
frequency bands. Furthermore, ENV cues are known to be more 
susceptible to noise degradations than intact speech. Over de- 
cades, speech recognition of CI users has improved dramati- 
cally. However, the performance of speech perception by CI 
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users is still not comparable to that of normal-hearing listen- 
ers, and this difference in performance is especially noticeable 
for speech perception in noise as well as music perception. 
Therefore, the lack of TFS cues in conventional CI encoding 
strategies reduces temporal pitch cues and may be responsi- 
ble for the generally poor performance of understanding 
speech in noise and music in CI users. 

This hypothesis has given motivation to develop new im- 
plant signal processing strategies to contain TFS cues in ad- 
dition to ENV cues.^^'^^' However, most CI users have the re- 
duced sensitivity to temporal modulation in electric hearing. 
Changes in the repetition rate of the electric waveform above 
approximately 300 Hz cannot be processed in most CI users, 
while TFS typically oscillates at a much higher rate.^" To over- 
come this limitation in delivering TFS information to CI us- 
ers, several approaches have been attempted. The HiRes strat- 
egy uses a relatively high envelope cutoff frequency and pulse 
rate to improve TFS information. Nie, et al.,^* proposed a new 
strategy which encodes both amplitude and FM. In the strat- 
egy, by removing the center frequency from the subband sig- 
nals and additionally limiting the FM's range and rate, it trans- 
forms the rapidly-varying TFS into a slowly varying FM signal 
which was applied to the carrier in each band. They found that 
adding this FM signal improved performance by as much as 
71% for sentence recognition in babble noise. Analog strate- 
gies also can be used for TFS transmission, but it increases elec- 
trode interaction, which ultimately decrease CI users' percep- 
tual abilities. 

Recently, to encode TFS information for CI users, a har- 
monic-single-sideband-encoder (HSSE) strategy was devel- 
oped.^''"^' The main difference of HSSE from other strategies 
is that the HSSE use incoherent approaches to extract tempo- 
ral cues, e.g. the Hilbert envelope or half full wave rectifica- 
tion followed by a low-pass filter. It explicitly tracks the har- 
monics of a single musical source and transforms them into 
modulators conveying both amplitude and TFS cues to elec- 
trodes. Using an auditory nerve model, the neural spike pat- 
tems evoked by HSSE and CIS for one melody stimulus were 
simulated, and it was revealed that HSSE can convey more 
temporal pitch cues than CIS. In addition, timbre recognition 
as well as melody recognition were significantly improved 
with HSSE.^*"' The results suggest that adding TFS cues using 
a specific processing strategy in CI users can enhance music 
perception with CIs. 

Hearing Aid and Temporal 
Fine Structure 

The healthy cochlea responds to sound in a way that is high- 



ly nonlinear and compressive. That means more gain applied 
at low sound levels than at high ones due to outer hair cell flinc- 
tion. However, if outer hair cells are damaged, it can result in 
a more linear basilar membrane response. Consequently, pa- 
tients with hearing loss (usually with some degree of outer 
hair cell damage) typically have problems like loudness recruit- 
ment and a reduced dynamic range. Most current hearing 
aids use this non-linear compression for patients with cochle- 
ar hearing loss. That is, the gain applied to a signal is inverse- 
ly related with the signal input level, so intense sounds are less 
amplified than weak sounds. This compression schemes used 
in hearing aids are typically categorized as fast or slow acting, 
depending on the values of attack time and release time. Fast 
compression and slow compression have their own strengths 
and limitations. Overall, previous studies suggested that slow 
compression was better than fast compression based on mea- 
sures of listening comfort, whereas fast compression was bet- 
ter than slow compression based on measures of speech intel- 
ligibility.^'" However, there are large individual differences 
for overall outcome following use of hearing aids. 

Hopkins, et al.'" reported high individual variability to use 
TFS information among hearing-impaired listeners. Moore™' 
suggested that an individual's sensitivity to TFS cue may af- 
fect which compression speed gives most benefit. Hearing aid 
users with good TFS sensitivity may benefit more from fast 
than from slow compression, because TFS information may 
be important for listening in the dips of a fiuctuating back- 
ground and fast compression increases the audibility of signals 
in the dips. This hypothesis was tested in normal-hearing lis- 
teners using vocoded signals. In the experiments, however, 
speech intelligibility was significantly better for fast compres- 
sion than slow compression regardless of conditions. That 
means, the availability of original TFS or detailed spectral in- 
formation does not affect the optimal compression speed."^" 
To confirm this result, the same experiments should be extend- 
ed to patients with a real cochlear hearing loss. 

IVIethods for Determining Sensitivity 
to TFS in Clinics 

Recent evidence suggests that cochlear hearing loss adverse- 
ly affects the ability to use TFS information in a variety of con- 
ditions, such as lateralization of sounds, pitch perception, and 
speech perception. '''''*''^^' Thus, clinical needs for accurately as- 
sessing ability to use TFS cue have been increased. Moore 
and Sek^"^' developed a simple and quick test for assessing 
sensitivity to TFS that can be used in clinical setting. The test 
is designed to discriminating a harmonic complex tone (H), 
with a fundamental frequency FO, from a similar tone in which 
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all harmonics were shifted up by the same amount in hertz, 
which becomes an inharmonic tone (I). For example, tone H 
may contain components at 1600, 1800, 2000, 2200, 2400 Hz 
for an FO of 200 Hz, while tone I may contain components at 
1620, 1820, 2020, 2220, 2420 Hz. The starting phases of the 
components are chosen randomly for each stimulus. Both 
tones have an envelope repetition rate equal to FO, but the 
tones differ in their TFS. To reduce spectral cues, all tones are 
passed through a fixed bandpass filter, and the default value 
of the center frequency is 1 IFO. A background noise is used to 
mask combination tones. During the test, the amount of shift 
was varied using an adaptive procedure. They conclude that 
this new test provides a simple, quick and robust way to mea- 
sure sensitivity to the TFS of complex tones. 

Conclusions 

It has been widely believed that temporal ENV is most im- 
portant in speech intelligibility in quiet. However, temporal 
ENV information can be easily degraded in noise. As described 
above, TFS information can be usefiil for speech perception 
in background noise (especially in modulated noise) as well 
as pitch/melody perception. However, hearing-impaired lis- 
teners have reduced sensitivity to TFS, and fail to benefit from 
TFS information. Thus, in auditory rehabilitation field, attempts 
have been made to enhance the use of TFS information in hear- 
ing-impaired listeners. Compensation for TFS deficits in co- 
chlear hearing loss could be achieved with continuous research- 
es in the near ftiture. 

REFERENCES 

1) Moore BC. The role of temporal fine structure processing in pitch 
perception, masking, and speech perception for normal-hearing and 
hearing-impaired people. J Assoc Res Otolaryngol 2008;9:399-406. 

2) Joris PX, Yin TC. Responses to amplitude-modulated tones in the 
auditory nerve of the cat. J Acoust Soc Am 1992;91:215-32. 

3) Drullman R. Speech intelligibility in noise: relative contribution of 
speech elements above and below the noise level. J Acoust Soc Am 
1995;98:1796-8. 

4) Hopkins K, Moore BC, Stone MA. Effects of moderate cochlear 
hearing loss on the ability to benefit from temporal fine structure 
information in speech. J Acoust Soc Am 2008;123:1140-53. 

5) Hopkins K, Moore BC. The importance of temporal fine structure 
information in speech at different spectral regions for normal-hear- 
ing and hearing-impaired subjects. J Acoust Soc Am 2010;127: 
1595-608. 

6) Shannon RV, Zeng FG, Kamath V, Wygonski J, Ekelid M. Speech 
recognition with primarily temporal cues. Science 1995;270:303-4. 

7) Won JH, Drennan WR, Rubinstein JT. Spectral-ripple resolution 
correlates with speech reception in noise in cochlear implant users. 
J Assoc Res Otolaryngol 2007;8:384-92. 

8) Xu L, Thompson CS, Pfingst BE. Relative contributions of spectral 
and temporal cues for phoneme recognition. J Acoust Soc Am 2005; 
117:3255-67. 



9) Lorenzi C, Gilbert G, Carn H, Gamier S, Moore BC. Speech percep- 
tion problems of the hearing impaired reflect inability to use tempo- 
ral fine structure. Proc Natl Acad Sci U S A 2006;103:18866-9. 

10) Smith ZM, Delgutte B, Oxenham AJ. Chimaeric sounds reveal di- 
chotomies in auditory perception. Nature 2002;416:87-90. 

11) Zeng FG, Nie K, Liu S, Stickney G, Del Rio E, Kong YY, et al. On the 
dichotomy in auditory perception between temporal envelope and 
fine structure cues. J Acoust Soc Am 2004;116:1351-4. 

12) Ghitza O. On the upper cutoff frequency of the auditory critical-band 
envelope detectors in the context of speech perception. J Acoust 
Soc Am 2001;110(3 Pt l):1628-40. 

13) Hopkins K, Moore BC. The contribution of temporal fine structure 
to the intelligibility of speech in steady and modulated noise. J Acoust 
Soc Am 2009;125:442-6. 

14) Qin MK, Oxenham AJ. Effects of simulated cochlear-implant pro- 
cessing on speech reception in fluctuating maskers. J Acoust Soc Am 
2003;114:446-54. 

15) Oxenham AJ, Bernstein JG, Penagos H. Correct tonotopic represen- 
tation is necessary for complex pitch perception. Proc Natl Acad Sci 
US A2004;101:1421-5. 

16) Moore BC, Glasberg BR. Factors affecting thresholds for sinusoidal 
signals in narrow-band maskers with fluctuating envelopes. J Acoust 
Soc Am 1987;82:69-79. 

17) Schooneveldt GP, Moore BC. Comodulation masking release (CMR): 
effects of signal frequency, flanking-band frequency, masker band- 
width, flanking-band level, and monotic versus dichotic presentation 
of the flanking band. J Acoust Soc Am 1987;82:1944-56. 

18) Hopkins K, Moore BC. Moderate cochlear hearing loss leads to a re- 
duced ability to use temporal fine structure information. J Acoust Soc 
Am 2007;122:1055-68. 

19) Ernst SM, Moore BC. Mechanisms underlying the detection of fre- 
quency modulation. J Acoust Soc Am 2010;128:3642-8. 

20) Gilbert G, Lorenzi C. The ability of listeners to use recovered enve- 
lope cues from speech fine structure. J Acoust Soc Am 2006;1 19:2438- 
44. 

21) Ardoint M, Sheft S, Fleuriot P, Gamier S, Lorenzi C. Perception of 
temporal fine-structure cues in speech with minimal envelope cues 
for listeners with mild-to-moderate hearing loss. Int J Audiol 2010; 
49:823-31. 

22) Won JH, Lorenzi C, Nie K, Li X, Jameyson EM, Drennan WR, et al. 
The ability of cochlear implant users to use temporal envelope cues 
recovered from speech frequency modulation. J Acoust Soc Am 2012; 
132:1113-9. 

23) Rubinstein JT, Wilson BS, Finley CC, Abbas PJ. Pseudospontaneous 
activity: stochastic independence of auditory nerve fibers with elec- 
trical stimulation. Hear Res 1999;127:108-18. 

24) Nie K, Stickney G, Zeng FG. Encoding frequency modulation to im- 
prove cochlear implant performance in noise. IEEE Trans Biomed 
Eng 2005;52:64-73. 

25) Li X, Nie K, Imennov NS, Won JH, Drennan WR, Rubinstein JT, et 
al. Improved perception of speech in noise and Mandarin tones with 
acoustic simulations of harmonic coding for cochlear implants. J 
Acoust Soc Am 2012;132:3387-98. 

26) Li X, Nie K, Imennov NS, Rubinstein JT, Atlas LE. Improved per- 
ception of music with a harmonic based algorithm for cochlear im- 
plants. IEEE Trans Neural Syst Rehabil Eng 2013;21:684-94. 

27) Zeng FG. Temporal pitch in electric hearing. Hear Res 2002;174: 
101-6. 

28) Oxenham AJ, Bacon SP. Cochlear compression: perceptual mea- 
sures and implications for normal and impaired hearing. Ear Hear 
2003;24:352-66. 

29) Gatehouse S, Naylor G, Elberling C. Linear and nonlinear hearing aid 
fittings-I. Patterns of benefit. Int J Audiol 2006;45:130-52. 

30) Moore BC. The choice of compression speed in hearing AIDS: the- 
oretical and practical considerations and the role of individual dif- 
ferences. Trends AmpHf 2008;12:103-12. 



6 Korean J Audiol 2014;18(1):1-7 



Moon IJ, et al. 



31) Hopkins K, King A, Moore BC. Tlie effect of compression speed on 
intelligibility: simulated hearing-aid processing with and without 
original temporal fine structure information. J Acoust Soc Am 2012; 
132:1592-601. 

32) Lacher-Fougere S, Demany L. Consequences of cochlear damage for 



the detection of interaural phase differences. J Acoust Soc Am 2005; 
118:2519-26. 

33) Moore BC, Sek A. Development of a fast method for determining 
sensitivity to temporal fine structure. Int J Audiol 2009;48:161-71. 



www.audiology.or.kr 7 



